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INTRODUCTION 


IGF-I  is  a  central  hormone  in  the  regulation  of  anabolic  (growth)  processes  as  a  function  of 
available  energy  and  elementary  substrates  (e.g.,  essential  amino  acids),  and  has  strongly 
mitogenic  and  anti-apoptotic  activities.  Results  from  in  vitro  studies  and  animal  experiments 
show  that,  in  exeess,  the  anabolic  signals  by  IGF-I  can  promote  the  development  of  tumors  at 
various  organ  sites,  and  recent  epidemiological  studies  have  shown  an  increased  breast  cancer 
risk  in  women  with  elevated  serum  IGF-I,  or  with  elevated  levels  of  IGF-I  for  given  levels  of 
IGFBP-3,  the  major  plasmatic  IGF-binding  protein. 

While  nutritional  status  is  one  important  determinant  of  circulating  IGF-I  levels  (Kaaks  & 
Lukanova,  2001),  heritability  studies  have  shown  that,  in  well-nourished  populations,  a  large 
part  (40-60  %)  of  variation  in  IGF-I  is  (co)determined  by  genetic  factors  (Harrela  et  al.,  1996; 
Hong  et  al.  1996;  Verhaeghe  et  al.,  1996).  To  increase  understanding  of  what  are  the  major 
determinants  of  IGF-I  levels,  as  well  as  cancer  risk,  we  conduct  a  study  with  the  following 
objectives: 

1 .  Confirm  that  elevated  prediagnostic  serum  levels  of  IGF-I  increase  breast  cancer  risk, 
especially  in  premenopausal  women; 

2.  Describe  exhaustively  existing  polymorphisms,  allele  frequencies  and  haplotypes  in  15 
selected  genes  related  to  the  secretion  of  growth  hormone,  and  hence  to  the  synthesis  of 
IGF-I  and  IGFBP-3;  and 

3.  Examine  whether  these  genetic  polymorphisms  are  related  to  significant  increases  or 
decreases  in  circulating  levels  of  IGF-I  and  IGFBP-3,  as  well  as  in  breast  cancer  risk. 

Our  project  is  a  large  case-control  study  nested  within  the  European  Prospective  Investigation 
into  Cancer  and  Nutrition  (EPIC),  using  prediagnostic  blood  (serum  and  DNA)  samples 
collected  during  1992-1998,  from  233,800  women  in  western  Europe. 

As  mentioned  in  our  original  application,  the  study  was  planned  in  four  parts: 

1.  A  case-control  study  (about  1000  cases  and  1000  controls)  nested  in  a  prospective 
cohort,  to  estimate  the  associations  of  serum  IGF-I  and  IGFBP-3  levels  with  breast 
cancer  risk; 

2.  Preparation  of  an  exhaustive  catalogue  of  polymorphisms  and  haplotypes  in  the  1 5 
selected  candidate  genes,  and  a  (“phase- 1”)  association  study  on  a  subset  of  400 
controls  to  identify  genotypes  that  have  a  minimum  level  of  association  with  serum  of 
IGF-I  and  IGFBP-3; 

3.  A  nested  case-control  study,  to  estimate  relative  risks  of  breast  cancer  in  relation  only 
to  those  genotypes  selected  in  phase- 1; 

4.  A  (“phase-2”)  study  of  associations  of  these  selected  genotypes  with  IGF-I  and 
IGFBP-3,  in  all  cases  and  controls. 
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BODY 


For  Year  1.  our  workplan  (as  in  the  “Statement  of  Work”  section  of  our  application),  was  as 
follows; 

1 .  Selection  of  cases  and  controls,  using  the  established  eligibility  and  matching  criteria,  and 
extraction  of  case-control  data  sets  with  relevant  information  from  questionnaires  and 
anthropometry:  Task  1,  months  1-2. 

2.  Retrieval  of  serum  and  huffy  coat  samples  from  the  central  EPIC  storage  facility; 
assembly  of  the  serum  samples  into  batches  of  matched  case-control  sets  for 
immunoassays;  assembly  of  the  buffy-coat  samples  into  batches  for  DNA  extraction: 

Task  2,  months  2-4. 

3.  Assays  of  IGF-I  and  IGFBP-3  serum  of  breast  cancer  cases  (n  =  1000)  and  controls  (n  = 
1000):  Task  3,  months  7-12. 

4.  DNA  extraction  for  all  2000  cases  and  controls:  Task  5,  months  1-12. 

5.  Preparation  of  an  exhaustive  catalog  of  polymorphisms  by  searching  the  literature,  and  by 
DHPLC  analysis  of  DNA  from  a  subset  of  200  subjects:  Task  6:  months  1-12. 

As  described  in  our  previous  report  (Year  1),  we  achieved  most  of  these  goals  in  Year  1, 
whereas  for  some  other  tasks  we  were  in  advance  of  our  planned  schedule.  The  following  list 
provides  a  quick  recapitulation  of  key  accomplishments  in  Year  1: 

a)  Case-control  selection,  DNA  extraction,  and  £rowth  factor  measurements'. 

An  almost  complete  selection  of  breast  cancer  cases  (n=810)  and  control 
subjects  (n=1620)  within  the  EPIC  cohorts,  for  the  nested  case-control  study 
on  serum  IGF-I,  IGFBP-3  and  breast  cancer  risk; 

Measurement  of  IGF-I  and  IGFBP-3  for  these  cases  and  controls; 

DNA  extraction  for  these  cases  and  controls; 

b)  Preparation  of  SNP  2enotvpins  tools: 

Identification  of  a  comprehensive  catalogue  of  >1 20  (possible)  single 
nucleotide  polymorphisms  (SNPs)  in  all  candidate  genes  included  in  the 
present  study; 

Preparation  of  a  DNA  genotyping  chip,  for  78  of  these  polymorphisms  that 
had  confirmation  of  being  true  polymorphisms; 

c)  Preliminary  senotypim  study. 

A  first  descriptive  study  of  SNP  and  haplotype  frequencies  for  all  candidate 
genes,  in  a  population  of  477  subjects  (women  plus  men);  and 
A  preliminary  analysis  of  associations  of  SNPs  and  circulating  levels  of  IGF-I 
and  IGFBP-3  (corresponds  to  Task  7,  originally  planned  for  Year  2); 

Selection  of  a  first  series  of  23  SNPs  to  be  typed  in  the  full  case-control  study, 
using  the  Taqman  assay. 
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During  Year  2.  we  completed  some  of  the  tasks  originally  planned  for  Year  1 .  hi  particular, 
we  included  274  cases  of  breast  cancer  and  496  control  subjects,  for  whom  questionnaire 
information  available  in  2002  did  not  allow  proper  control  matching  (this  information  arrived 
during  2002).  Thus,  total  numbers  of  breast  cancer  cases  and  control  subjects  were  brought  to 
1084  and  2116,  respectively. 

For  these  additional  study  subjects,  DNA  was  extracted  from  their  stored  huffy  coat 
specimens,  and  additional  serum  measurements  of  IGF-I  were  performed.  For  IGFBP-3, 
measurements  were  delayed  because  of  technical  difficulties  with  the  (commercial)  assay, 
due  to  complications  at  the  manufacturing  site  (DSL,  TX,  USA).  In  collaboration  with  the 
company,  these  problems  have  now  been  solved,  and  completion  of  the  IGFBP-3  assays  is 
scheduled  for  September-October  2003. 

Tasks  planned  for  Years  2  and  3  (“Statement  of  Work”  section  of  our  study  proposal)  were: 

6.  Statistical  analysis  of  the  nested  case-control  study,  relating  breast  cancer  risk  to 
prediagnostic  serum  levels  of  IGF-I  and  IGFBP-3  (Task  4,  months  13-24); 

7.  Complete  genotyping  of  a  subset  of  200  breast  cancer  cases  and  200  control  subjects; 
Statistical  analysis  of  phase- 1  association  study,  relating  genotypes  to  serum 
concentrations  of  IGF-I  and  IGFBP-3  (Task  7;  months  12-24);  and 

8.  Genotyping  of  the  remaining  1600  cases  and  controls,  for  a  limited  set  of 
polymorphisms,  selected  in  phase  1 ; 

Statistical  analysis  of  associations  between  polymorphisms,  serum  levels  of  IGF-I  and 
IGFBP-3,  and  breast  cancer  risk;  Writing  of  reports  (Task  8;  months  24-36). 

All  of  the  goals  up  to  the  end  of  Year  2,  and  much  of  the  goals  for  Year  3,  have  been  met: 

(a)  Statistical  analysis  of  breast  cancer  risk  in  relation  to  prediagnostic  serum  levels  of 
IGF-I  and  IGFBP-3  (Task  4): 

Although  IGFBP-3  measurements  are  not  complete  yet  (see  above),  we  performed 
extensive  statistical  analyses  of  the  relationships  of  breast  cancer  risk  with  serum  IGF- 
I  and  IGFBP-3  concentrations.  These  analyses  showed  no  clear  relationship  of  breast 
cancer  risk  with  IGF-I  among  women  who  were  premenopausal  at  the  time  of  blood 
donation,  contrary  to  findings  from  some  previous  cohort  studies.  Among 
postmenopausal  women,  however,  there  was  a  moderate  association  of  risk  with  IGF-I 
(relative  risk  of  about  1.5  between  the  extreme  quintiles  of  the  IGF-I  exposure 
distribution). 

(b)  Further  SNP  genotyping  of  breast  cancer  cases  and  control  subjects  (Task  8): 

For  all  1084  breast  cases  and  2116  controls  included  in  th  nested  case-control  study,  a 
first  series  of  23  SNPs  were  typed  by  Taqman.  For  an  additional  number  of  SNPs, 
genotyping  assays  by  the  Taqman  technique  did  not  work  (a  normal  observation  with 
this  type  of  assay);  for  these,  we  are  currently  trying  to  set  up  alterantive  assays  (e.g., 
“Amplifluor”).  For  yet  another  set  of  SNPs,  our  DNA  chip  initially  used  for 
genotyping  in  a  smaller  subset  of  individuals  (total  of  78  SNPs)  did  not  give  valid 
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results,  and  we  were  thus  unable  not  confirm  whether  or  not  polymorphic  alleles  were 
prevalent  enough  to  make  their  measurement  worthwhile.  For  this  latter  subset,  we 
have  now  decided  to  type  all  of  these  SNPs  by  Taqman  or  the  Amplifluor  technique. 

In  total,  some  18  SNPs  remain  to  be  typed  for  all  case  and  control  subjects,  which  will 
bring  the  total  number  of  SNPs  typed  to  41 .  This  total  number  of  assays  (SNPs)  is 
considerably  larger  than  that  foreseen  in  our  original  study  proposal;  however, 
genotyping  of  this  increased  number  largely  has  become  possible  due  to  the 
progressive  reductions  in  genotyping  reagent  costs. 

(c)  Statistical  analysis  of  SNP  /  haplotvpe  associations  with  IGF-I  and  IGFBP-3  (Tasks  7 
and  8): 

In  Year  1,  performed  preliminary  statistical  analyses  were  performed  to  examine 
possible  associations  of  SNPs  and  their  haplotypes  with  serum  concentrations  of  IGF-I 
and  IGFBP-3,  within  a  first  series  of  477  subjects  (see  our  previous  report). 

During  Year  2,  we  re- analysed  our  much  more  extensive  data,  from  all  1084  breast 
cancer  cases  and  2116  controls,  using  the  same  methods  as  in  Year  1 . 

Analyses  were  performed  with  a  variety  of  statistical  models,  of  increasing 
completeness  and  complexity: 

-  at  the  level  of  individual  SNP  loci:  associations  of  alleles  with  IGF-I  and  IGFBP-3 
concentrations  were  estimated  under  assumption  of  dominance,  codominance,  or 
recessiveness  of  alleles. 

-  at  the  level  of  full  2ene  loci:  individuals’  most  likely  haplotypes  were  estimated, 
and  individuals’  haplotype  combinations  (of  the  two  chromosomes)  were 
associated  with  IGF-I  and  IGFBP-3  levels;  in  further  analysis,  haplotypes  were 
transformed  into  SNP  genotypes  sorted  by  parental  phase,  and  generalized  linear 
models  (GLM)  were  used  to  relate  phenotype  (IGF-I,  IGFBP-3)  to  these  phased 
genotype  variables,  plus  their  possible  interaction  terms  within  a  gene  (following 
an  approach  described  by  Cordell  and  Clayton,  2001). 

During  Year  2,  we  have  also  been  working  further  on  refinements  of  the  statistical 
analysis  procedures,  implementing  several  new  procedures  (programmed  in  S  AS),  and 
testing  different  existing  programs  for  estimation  of  haplotypes  (‘Arlequin’; 

‘PHASE’;  ‘SAS-Genetics’;  TagSNP).  A  final  analysis  will  be  performed  when 
genotyping  of  all  41  SNPs  will  be  completed  (planned  to  be  complete  by  the  end  of 
2003).  Only  after  completion  will  it  be  possible  to  estimate  full  haplotypic  variation  at 
gene  loci,  and  to  perform  definitive  analyses. 

Preliminary  analyses  of  Year  1  suggested  that  up  to  some  20%  of  the  total  between- 
subject  variation  in  IGF-I  levels  might  be  explained  by  measured  genetic 
polymorphisms.  This  percentage  was  most  likely  overestimated,  due  to  a 
comparatively  small  study  size  and  over  fitting  of  statistical  models.  Our  current 
analyses,  based  on  much  larger  numbers  of  observations,  show  smaller,  but  probably 
more  realistic  effects  of  SNPs  and/or  haplotypes  on  serum  IGF-I  and  IGFBP-3  levels. 
Amongst  the  most  significant  associations  was  the  relationship  of  IGFBP-3  levels 
with  SNPs  in  the  IGFBP-3  gene,  which  has  also  been  observed  in  other  studies 
(Jemstrdm  et  al.,  2001). 
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(d)  Statistical  analysis  of  SNP  /  haplotvpe  associations  with  breast  cancer  risk  (Task 

8}: 

Preliminary  statistical  analyses  have  been  performed,  using  logistic  regression 
models  to  relate  breast  cancer  risk  to  SNP  genotypes  as  well  as  to  haplotypes.  The 
analytical  strategy  is  similar  to  that  for  the  analysis  in  relation  to  serum  IGF-I  and 
IGFBP-3  concentrations  (described  above).  In  addition,  we  have  implemented  a 
series  of  models  to  estimate  haplotype  relative  risks,  for  a  haplotype  ‘dosage’ 
estimated  as  posterior  probability  for  each  of  the  possible  haplotypes  given  SNP 
genotypes  (following  methods  described  by  Stram  et  al.,  2003). 

A  definitive  statistical  analysis  of  breast  cancer  risk  in  relation  to  SNP  genotypes 
and,  especially,  haplotypes  will  be  possible  only  after  completion  of  genotyping 
for  all  41  SNPs  (by  the  end  of  2003). 

KEY  RESEARCH  ACCOMPLISHMENTS 

Key  research  accomplishments  in  Year  2  were: 

-  Inclusion  in  the  study  of  274  additional  breast  cancer  cases  and  496  additional  control 
subjects,  bringing  the  total  number  of  cases  and  controls  to  1084  and  2116,  respectively; 

-  DNA  extraction  and  genotyping  of  a  first  series  of  23  SNPs,  for  all  cases  and  controls; 

-  IGF-I  measurements  for  all  cases  and  controls;  nearly  complete  measurements  of  IGFBP- 
3. 

-  Extensive  preliminary  statistical  analyses  of  associations  between  breast  cancer  risk, 
prediagnostic  serum  levels  of  IGF-I  and  IGFBP-3,  and  SNP  and  haplotype  data; 
Preparation  of  programs  and  S  AS  macros  for  all  analytical  steps,  ready  to  be  implemented 
when  all  laboratory  analyses  will  be  completed. 

PLAN  FOR  YEAR  3 

Plans  for  Year  3  are  as  follows: 

-  Completion  of  genotyping  of  the  1 8  additional  SNPs  identified,  and  selected  for  the  study; 

-  A  definitive  statistical  analysis  of  the  complete  SNP  genotype  and  haplot)^e  data,  in 
relation  to  serum  IGF-I  and  IGFBP-3,  as  well  as  breast  cancer  risk; 

-  Preparation  of  articles,  to  report  all  major  findings  in  scientific  journals. 

REPORTABLE  OUTCOMES 

None,  so  far.  First  drafts  of  articles  are  in  preparation;  these  will  be  finalized  after 
completion  of  all  laboratory  assays  for  genotypes  and  IGFBP-3,  and  definitive  statistical 
analysis. 
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CONCLUSIONS 


Our  study  is  ahead  of  schedule,  and  actually  has  been  expanded  to  include  a  larger  number  of 
SNP  assays  than  initially  foreseen  (made  possible  because  of  decreasing  genotyping  costs). 
Results  so  far  do  indicate  a  number  of  associations  of  IGF-I,  IGFBP-3  and  breast  cancer  risk 
with  SNP  genotypes  and/or  haplotypes  for  full  gene  loci.  The  overall  proportion  of  between- 
person  variance  in  circulating  levels  of  IGF-I  and  IGFBP-3  explained  by  the  genetic 
polymorphisms  is  modest,  however,  but  is  still  based  on  rather  incomplete  SNP  and  haplotype 
data. 

Elevated  serum  IGF-I  concentrations  show  a  modest  increase  in  breast  cancer  risk,  but  only 
among  postmenopausal  women. 

Final  statistical  analyses  of  associations  between  breast  cancer  risk  in  relation  to  IGF-I, 
IGFBP-3,  and  genetic  variants  will  be  performed  in  Year  3,  after  the  completion  of  the 
IGFBP-3  assays  and  additional  SNP  genotyping. 
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Figure  1.  Relative  risk  of  breast  cancer  by  quintiles  of  growth  hormone  levels 


hormone  quintile  Cases  OR  95%CI 
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Figure  2.  Relative  risk  of  breast  cancer  among  premenopausal  women 
by  quintiles  of  growth  hormone  levels 
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Figure  3.  Relative  risk  of  breast  cancer  among  postmenopausal  women 
by  quintiles  of  growth  hormone  levels 
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Figure  4.  Relative  risk  of  breast  cancer  among  women  younger  than  50  at  diagnosis  at  diagnosis 

by  quintiles  of  growth  hormone  levels 


hormone  quintaie  Cases  OR  95%CI 
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Figure  5.  Relative  risk  of  breast  cancer  among  women  older  than  50  at  diagnosis 
by  quintiles  of  growth  hormone  levels 


hormone  quintile  Cases  OR  95%CI 
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Figure  6.  Relative  risk  of  breast  cancer  among  women  younger  than  60  at  diagnosis 

by  quintiles  of  growth  hormone  levels 


hormone  qumble  Cases  OR  95%Ci 
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Figure  7.  Relative  risk  of  breast  cancer  among  women  older  than  60  at  diagnosis 
by  quintiles  of  growth  hormone  levels 


hormone  quintile  Cases  OR  95%CI 
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Table  3:  For  each  SNP,  numbers  of  observations,  means  of  IGFBP3  by  category  of  polymorphisms  (homozygotes  1*‘  allele,  heterozygotes,  homozygotes  2"**  allele),  p- 
values,  R-square  and  R-square  adjusted  of  different  models  with  IGFBP3  as  dependent  variable  (codominant  model,  dominant  I”*  allele,  dominant  2"*’  allele)  and 
average  excess  p-values  of  IGFBP3 _ _ _ _  ^ _ _ _ _ _  ' 
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Table  4:  Number  of  SNPs  per  gene,  R-square  adjusted  of  full  rank  haplotype  models  and  results  of  stepwise  regression  models 


G313  346  665  1011  1.019  0.762  1.362  0.8988  241.774  .  2980.28 
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